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1. Introduction 

One of the most salient features of complex networks [1, 2, 3, 4] is their scalefreeness: 
they usually exhibit a broad degree distribution falling off as a power law: 

fk~k~\ (1.1) 

with 7 > 2, so that the mean degree is finite. Growing networks with preferential 
attachment, such as the Barabasi-Albert model [5], brought a natural explanation 
for the ubiquity of the observed scalefreeness. These model systems also provide a 
playground to investigate other, more refined features of networks and other random 
growing structures. In this work we focus our attention onto leaders and lead changes. 
Luczak and Erdos [6] already describe as a kind of a race the growth of connected 
components in the Erdos- Renyi model for random graphs [7], and refer to the largest 
component as the leader. This terminology was then introduced in the physics literature 
by the pioneering work of Krapivsky and Redner [8] , in concomitance with the growing 
interest in social networks. As suggested by these authors, as the degree of a node 
may quantify the wealth of a corporation or the popularity of a person, it is natural to 
investigate questions such as How does the identity of the leader change in the course of 
time? What is the probability that a leader retains the lead as a function of time? 

The first and most natural of all quantities pertaining to this area is the degree of 
the leader, i.e., the largest of the node degrees at time n. This quantity is known to 
play a central role in finite-size effects on the degree statistics. The following picture 
has indeed progressively emerged [9, 10, 11]. For a large but finite scalefree network at 
time n, the 'stationary' power law (1.1) is cutoff at some time-dependent scale k*(n), 
which is of the order of the typical value of the largest degree k mSuX (n). The latter can in 
turn be estimated, at least as a first approximation, by means of a well-known argument 
of extreme- value statistics [12]: the stationary probability for the degree k to be larger 
than k mSLX (n) is of order 1/n. In the scalefree case, the largest degree is thus predicted 
to grow as a power law: 

fcmax(n) ~ n", v = — !— -. (1.2) 
7-1 

This growth law is subextensive, as 7 > 2 implies v < 1. 

The goal of the present paper is to provide a systematic study of the distribution of 
the largest degree and of other quantities related to leaders and lead changes in growing 
networks, thus improving and generalizing most of the results presented in [8, 13]. We 
consider network models where a new node enters at each time step, so that nodes can 
be identified by an index equal to the time they enter the network. We adopt the setup 
and some of the notations of our earlier work [11], devoted to finite-size effects on the 
degree statistics. The quantities to be investigated concern either the leader at a given 
time or the whole history of lead changes: 

• The largest degree at time n is by definition the largest of the degrees ki(n) of all the 
nodes at time n (i = 1, . . . , n): 

fcmax(n) = max(/ci(n), . . . , k n (n)). (1.3) 
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We will investigate both the mean value of k max (n) and its full distribution. 

• A co-leader at time n is any node whose degree is equal to the largest degree k mSuX (n). 
We denote by C(n) the number of co-leaders at time n. As the node degrees are integers, 
it is not rare that the largest degree (or any other value) is simultaneously shared by 
two or more nodes. 

• The leader at time n is the node among the co-leaders whose degree reached the value 
fc max (n) first. This definition is already used in [6]. We denote by I(n) the index of the 
leader at time n. Initial conditions will be such that the first node is the first leader. 

• There is a lead change at time n if the leader at time n is different from that at 
time n — 1. We call a lead any period of time between two lead changes. We denote by 
C(n) the number of leads up to time n. In other words, the number of lead changes up 
to time n is C(n) — 1. 

• Finally, some lead changes bring to the lead a node that has already been the leader 
in the past, whereas some other changes promote a newcomer. We denote by D{n) 
the number of distinct leaders up to time n. We also introduce the lead persistence 
probability S(n) as the probability that there is a single leader up to time n. 

We will consider network models where node n attaches to a single earlier node 
(i = 1, . . . , n — 1), with a prescribed attachment probability: 

• Uniform attachment (UA) (Section 2): the attachment probability is independent of 
the node, i.e., uniform over the network. This model has a geometric degree distribution: 
it is therefore not scalefree. Its analysis however allows a detailed introduction of 
key concepts and quantities, such as the discussion of the 'stationary extreme-value 
statistics' approach, given in Section 2.1. 

• Barabdsi- Albert (BA) model (Section 3): the attachment probability is proportional 
to the degree of the earlier node. This well-known model [5] is scalefree, with exponents 
7 = 3 and v — 1/2. 

• General preferential attachment (GPA) (Section 4): the attachment probability is 
proportional to the sum k + c of the degree k of the earlier node and of a constant 
c > — 1. This parameter, representing the initial attractiveness of a node [9], yields the 
continuously varying exponents 7 = c + 3 and v — l/(c + 2). 

Finally, for each of these models, it will be interesting to simultaneously analyze 
data for two different initial conditions [11]: 

• Case A. Node 1 appears at time n — 1 with degree fci(l) = 0. At time n = 2 node 2 
attaches to node 1, so that k±(2) = k 2 (2) = 1. At time n = 3 node 3 attaches a priori 
either to node 1 or to node 2 with equal probabilities. We choose to attach node 3 to 
node 1, obtaining thus fci(3) = 2, whereas ^(3) = ^3(3) = 1, so that node 1 is the first 
leader. This is the initial condition used e.g. in [8]. 

• Case B. Node 1 appears at time n = 1 with degree ki(l) = 1. This amounts to saying 
that the first node is connected to a root, which does not belong to the network. It is 
natural to represent this connection by half a link. At time n = 2 node 2 attaches to 
node 1. We thus have k±(2) = 2 and £2(2) = 1, so that node 1 is again the first leader. 
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The sum of the degrees fcj(n) of all the nodes at time n equals twice the number of 
links L(n) in the network [11, Eq. (1.3)]: 

n 

5>(n) = 2L(n), (1.4) 
i=i 

with 2L^\n) = 2n — 2 and 2L^ B \n) — 2n — 1. Here and in the following, the super- 
scripts (A) and (B) denote a result which holds for a prescribed initial condition (Case A 
or Case B). 

2. The uniform attachment (UA) model 

In the uniform attachment model, each new node n attaches to any earlier node 
(i — 1, . . . , n — 1) with uniform probability: 

2.1. Largest degree 

The stationary degree distribution in the UA model has a geometric form: 

/* = 2- fc . (2.2) 

It is useful to recall the derivation of this result. The distribution fk(n,i) of the degree 
ki(n) of node % at time n obeys the recursion [11, Eq. (2.11)] 

nf k (n + 1, i) = fk-i(n, i) + (n - l)fk(n, i), (2.3) 

with fk(i,i) = Sk,i for i > 2. The distribution of the degree of an arbitrary node, 

fk(n) = -itfk(n,t), (2.4) 
n i=i 

therefore obeys the recursion 

(n + l)f k (n + 1) = / fc _x(n) + (n - l)/ fc (n) + 4,i- (2-5) 

Finally, the stationary degree distribution fk (corresponding to n — > oo at fixed fc) obeys 
the recursion 

2/ fc = + (2.6) 

which easily yields (2.2). 

Applying the extreme-value argument recalled in the Introduction to the geometric 
distribution (2.2), we recover the logarithmic growth of the largest degree [8]: 

, > In rh . . 

fcmax(n) ~ (2.7) 

One could be tempted to go further along this line of reasoning and to assert that the 
full distribution of the largest degree k mgx (n) can be derived, asymptotically for n large, 
from the stationary extreme-value statistics approach. This consists in modeling /c max (n) 
as the largest of n independent variables fcj drawn from the stationary distribution fk- 
As already suggested in [13], this approach cannot however be exact for the following 
three reasons, in order of increasing importance: 
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(1) Sum rule. The degrees fcj(n) are not independent random variables, as they obey 
the sum rule (1.4). This condition is however harmless for large n, as it imposes a 
single constraint on n variables. 

(2) Finite-size (i.e., finite-time) effects. The distribution fk{n) of the degree of a node 
at time n does not coincide with its stationary value It is indeed affected by 
finite-size effects, which have been studied in detail in [11]. As recalled in the 
Introduction, these effects become important for k ~ ~ k max (n). Finite-size 
effects may therefore alter the statistics of the largest degree. 

(3) Correlation between degree and index. The degrees ki(n) of the nodes are not 
identically distributed. In particular, older nodes typically have larger degrees. 
Their distribution fk{n,i) may depend rather strongly on the node index i, 
especially for i <n. 

Points (2) and (3) raise rather delicate issues, which deserve a specific investigation 
for each model. The UA model is a rather fortunate case where the above points are to 
a large extent under control [11, Sec. 2]. Concerning point (3), the dependence of the 
distribution f k (n,i) on the index i is rather weak. We have for instance [11, Eq. (2.4)] 

77 

(kAn)) w In - + 1 (2.8) 

i 

for 77 and i large. The leading logarithmic growth of this expression is independent 
of the node index i, which only enters the 'finite part' of the logarithm. Concerning 
point (2), i.e., finite-size effects, the ratio between the true distribution fk{n) for n finite 
and its stationary counterpart fk stays approximately equal to unity, before it drops 
to zero in a range of degrees scaling as (Inn) 1 / 2 around k ~ 2 In 77. The prefactor 2 of 
this estimate is larger than the prefactor 1/ In 2 m 1.442695 of the law (2.7), so that the 
distribution fk(n) is expected to be closer and closer to the stationary one for degrees k 
near the mean largest degree. We are therefore tempted to argue that the 'stationary 
extreme- value statistics' approach, or stationary approach for short, provides a good 
description of the distribution of the largest degree k max (n) and of related observables, 
which may even yield exact asymptotic results, at least for quantities which are only 
sensitive to the leading logarithmic growth of expressions like (2.8), and not to their 
'finite parts'. 

The statistics of extremes for i.i.d. integer variables is reviewed in Appendix A, 
with emphasis on geometric distributions. The exact distribution of the largest degree 
is given by (A. 8), whereas a simpler expression valid for large 77 is provided by the 
'discrete Gumbel law' (A. 9). The logarithmic growth (A. 20) of the mean largest degree 
agrees with (2.7) for a = 1/2, whereas fluctuations are typically of order unity. 

We now compare the predictions of the stationary approach to the outcomes of 
direct numerical simulations of the model. Throughout this section on the UA model, 
numerical data are obtained by averaging over 10 6 different histories, i.e., stochastic 
realizations of the network, for both initial conditions (Cases A and B), up to time 
77 = 10 6 , unless otherwise stated. Statistical errors are smaller (and usually much 
smaller) than symbols. 
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Figure 1. Plot of the finite part of the mean largest degree, (fc max (n)) — Inn/ In 2, 
against Inn. Circles: numerical data for the UA model with both initial conditions. 
Squares and horizontal line: exact stationary values derived from (A. 8) and their limit 
(see (A.20)). 



Figure 1 shows a plot of the finite part of the mean largest degree, (k max (n)) — 
lnn/ln2, obtained by subtracting from the data the leading logarithmic law (2.7), 
against Inn. The figure also shows the corresponding exact stationary finite part 
(squares), derived from (A. 8). The latter converges very fast to the limit 7 E / In 2 + 1/2 w 
1.332746 (horizontal line), given by (A.20), where 7 E « 0.577215 denotes Euler's 
constant. The numerical data (circles) are observed to grow very slowly with n. They 
however seem to converge to asymptotic values in the same ball park as the above limit 
of the stationary finite part. Figure 2 shows a plot of the full distribution of k mSuX (n), 
for three values of time n. Only data for Case B are presented, for the sake of clarity. 
The exact stationary distributions (A. 8) are plotted for comparison. The agreement 
between numerical data and analytic predictions of the stationary approach is observed 
to improve for larger n. The variability observed in the shape of the distribution near 
its top is explained in Appendix A in terms of periodic oscillations. 

2.2. Number of leads 

The number of leads C(n) up to time n can be estimated by elaborating on the 
observation [8] that lead changes are driven by the presence of co-leaders. In the present 
situation, this line of reasoning yields a quantitative result. The probability for a lead 
change to take place at time n + 1 is indeed proportional to C(n) — 1, the number of 
co-leaders at time n which are not the leader. Furthermore the (n + l)-st node attaches 
to each of those co- leaders with probability l/n. We have therefore the exact relation 

(C(n+l)}-(C(n)}= {C{n)) ~ 1 . (2.9) 

n 

Within the stationary approach, the mean number of co-leaders is shown in Appendix A 
to have a finite limit (A. 29) for large n. For a = 1/2 we obtain (C) = l/ln2. 
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Figure 2. Plot of the distribution of the largest degree fc max (n) in the UA model for 
n = 10 3 , 10 4 and 10 5 . Numerical data with initial condition B (circles) are compared 
to the exact stationary expression (A. 8) (squares). 
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Figure 3. Plot of the mean number (£(n)) of leads against Inn, for the UA model 
with both initial conditions. The full line has the theoretical slope Ac = l/ln2 — 1» 
0.442695. 



According to the discussion of Section 2.1, this leading-order result can be trusted to 
be asymptotically exact for large n. Hence 

(C(n)) « A c Inn, A c = ^ - 1 ~ 0.442695. (2.10) 

Figure 3 shows a plot of (C(n)) against Inn for both initial conditions. The data 
are observed to follow the logarithmic law (2.10), albeit with appreciable finite-time 
corrections, especially for Case A. 
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2. 3. Index of the leader 

The study of the mean index (I(n)) of the leader at time n also follows the line of 
reasoning of [8]. The essential ingredient is the mean index of a node of degree k. This 
quantity can be derived by the following approach, which does not require any a priori 
knowledge of asymptotic degree statistics. The mean index of a node of degree k at 
time n reads 

fk{n) 

where the denominator fk{n) has been introduced in (2.4), whereas the numerator, 

0kM = -£i/ fc (M), (2.12) 
n 1=1 

obeys the recursion 

in + l)g k (n + 1) = g k ^(n) + (n - l)g k (n) + (n + 1)<S M . (2.13) 
In the stationary regime (n — > oo at fixed k), we have gk{n) ~ wyk, with 

37fc = 7fc-i + <*M> ( 2 - 14 ) 



7fe = 3- fc (2.15) 



(i fc ) » ( ^ n. (2.16) 



hence 



and finally 



,3, 

The mean index of the leader can be estimated by replacing k in (2.16) by the 
logarithmic law (2.7) giving the typical largest degree. We are thus left with [8] 

(I(n)) ttAm^, ip = 2 - w 0.415037. (2.17) 

The full-fledged stationary approach allows us to derive a better approximation of (I(n)), 
including a numerical estimate for the amplitude. Within this framework, averaging 
the expression (2.16) over the stationary extreme- value statistics of k ma , x (n) yields 
(I(n)) nG n (s), with s = ln(2/3) and a = 1/2, where the generating function G n (s) 
is defined in (A. 12). The expressions (A.15)-(A.19) corroborate the power law (2.17), 
and yield the estimate A IjBtajk = T(ln3/ln2 - l)/(31n2) ^ 0.733240 for the amplitude 
(with 'stat' for 'stationary'). 

Figure 4 shows a plot of the ratio (I(n))/n^ against Inn, for both initial conditions. 
Both series of data seem to converge, albeit very slowly, to a common limit Aj m 0.68 
(full line), in the same range as the stationary estimate Az> tat . 
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Figure 4. Plot of the mean index of the leader (I(n)), for the UA model with both 
initial conditions, divided by the theoretical power law rv', against Inn. The horizontal 
line shows a plausible common limit around A\ w 0.68. 

2.4- Lead persistence probability 

The lead persistence probability S(n) is the probability that the first node keeps the 
lead up to time n. It is suggested in [8] that S(n) can be estimated as the probability 
that the degree k\(n) of the first node is equal to the mean largest degree, i.e., 
(^max(n)) ~ Inn/ In 2 (see (2.7)), up to negligible finite fluctuation. Elaborating on 
this line of thought, we propose to give a better derivation of S(n) as the probability 
that the degree fci(m) of the first node has been equal to the mean largest degree for all 
times m up to the current time n. This improvement takes into account the fact that a 
persistence phenomenon is intrinsically non-local in time. 

The distribution of the degree k\(n) of the first node has been derived in 
several works. Consider Case A for definiteness. The corresponding generating series 
reads [8, 11] 

so that we have 

fkM = fz^ F ^ x) - (2 ' 19) 
In a first step, let us consider the probability for ki(n) to be very different from its 
mean value at time n (see (2.8)), namely k\{n) m blnn with 6^1. This probability can 
be estimated by applying the saddle-point approximation to the above contour integral. 
We thus obtain a logarithmic large-deviation estimate of the form 

/MnnM) ~ (lnn)- 1/2 n" $ W, $(6) = 1-b + bhxb, (2.20) 

with $(6) (b — l) 2 /2 as b — > I. The above estimate holds irrespective of the initial 
condition. In the case of interest, i.e., b = l/m2, the exponent reads <fi = $(l/m2) m 
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Figure 5. Log-log plot of the lead persistence probability S(n) against time n, 
Symbols: Data for the UA model with both initial conditions. Dashed and full line: 
asymptotic power law and full estimate (2.21), with arbitrary numerical prefactors 
chosen for readability. 



0.086071 [8]. In a second step, we estimate the lead persistence probability as the 
probability that k\{m) has been of order blnm for all times m up to time n, and not 
only at the current time n. The effect of such a condition has been investigated in detail 
in the case of the one-dimensional lattice random walk [14]. As a general rule, the mere 
effect is to change the exponent of the 'prefactor' from 1/2 to 3/2. We are therefore left 
with the estimate 

S(n) ~ (lnn)- 3/2 n-^. (2.21) 

Figure 5 shows a log-log plot of the lead persistence probability against time n, for 
both initial conditions and n up to 10 6 . The data points are far from being aligned, 
indicating thus a strong deviation from a pure power law [8]. Moreover, the effective 
slope over the plotted data range is around 0.3, i.e., four times larger than the theoretical 
asymptotic exponent (dashed line). The full estimate (2.21) (full line), including the 
prefactor (lnn)~ 3//2 , gives a much better representation of the data. 

2. 5. Number of distinct leaders 

The mean number of distinct leaders up to time n is expected to grow logarithmically: 

(D(n)) ~ Ad Inn. (2.22) 

Two heuristic reasons can be given in favor of this law. First, the number of lead changes 
up to time n, C(n) ~ Inn, grows much less rapidly than the number of candidates to the 
lead, which can be estimated as I(n) ~ r$ . It is therefore likely that any lead change 
brings a newcomer to the lead, with some non-zero probability IT. Second, the growth 
of the largest degree k max (n) ~ Inn is modest, so that newcomers have 'a short way to 
go' before they can reach the lead. 
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Figure 6. Plot of the mean number (D(n)) of distinct leaders up to time n, against 
time n, for the UA model with both initial conditions. The full line has slope 
A D w 0.29. 



Figure 6 shows a plot of the mean number (D(n)) of distinct leaders, against 
time n, for both initial conditions. The picture is somewhat similar to that of Figure 3, 
with appreciable finite-time corrections, especially for Case A. The data confirm our 
expectation, namely an asymptotic logarithmic law of the form (2.22), irrespective of 
the initial condition. The slope A D ps 0.29 of the full line is the outcome of a fit including 
a correction in 1/n. The ratio of A D to the amplitude A c of the growth law (2.10) of the 
mean number of leads yields the probability for any lead change to bring a newcomer 
to the lead: 

n = 4^ ~ 0.65. (2.23) 
Ac 

3. Linear preferential attachment: the Barabasi-Albert (BA) model 

In the Barabasi-Albert (BA) model, node n attaches to any earlier node with a 
probability proportional to the degree of that node. The attachment probability of 
node n to node % thus reads 
_ kj(n - 1) 

Pn ' l ~ Z(n-l)' { ' 

where the partition function in the denominator ensures the normalization of the 
probabilities. It reads Z(n) = 2L(n) (see (1.4)), i.e., Z {A \n) = 2n-2, Z {B \n) = 2n-l. 

3.1. Largest degree 

The stationary degree distribution of the BA model can be derived along the lines of 
Section 2.1. The distribution fk(n, i) of the degree of node i at time n obeys the recursion 

Z(n)f k (n + l,i) — (k — l)/ fc _i(n,i) + (Z(n) - k)f k (n,i), (3.2) 
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with fk(i,i) = S k ,i for i > 2. The distribution of the degree of an arbitrary node 
therefore obeys the recursion 

(n + l)Z(n)f k (n + 1) = n(k - l)/ fc _i(n) + n(Z(n) - k)f k (n) + <S M . (3.3) 

Finally, the stationary degree distribution f k on an infinite network obeys the recursion 

(fc + 2)/ fc = (fc-l)/ fc _ 1 + 2<f M , (3.4) 

hence 

4 

/fc = fc(jfe + l)(Jfc + 2)- (3,5) 
For a large degree k, the stationary distribution and the corresponding cumulative 
distribution scale as f k ~ 4/A; 3 and F k ~ 2/A; 2 . Applying the extreme-value argument 
to the latter estimate, we predict that the largest degree grows as 

fcmax(n) ~ n 1/2 . (3.6) 

In our quest of the distribution of the largest degree k max (n), our first approximation 
relies on stationary extreme-value statistics. We recall that this approach cannot 
be exact, for the three reasons exposed in Section 2.1. Within this approach, the 
distribution <fi k = Prob{A; max (n) = k} of the largest degree can be estimated by means 
of (A. 6). The first term (C = 1) is clearly leading for k large, where f k is small. We 
thus obtain a distribution of the form <fi k m nf k exp(—nF k ), i.e., 

(3.7) 

In other words, the ratio 

Y(n) = (3.8) 

is predicted to have a non-trivial asymptotic distribution py. Our first approximation 
to the latter distribution, given by the stationary approach, reads 

pyMy) « '■ (3 - 9) 

This is the Frechet distribution known in extreme- value statistics [12]. We have 
in particular (F) sta t = (2-k) 1 / 2 w 2.506628, whereas the second moment (F 2 ) stat is 
divergent. This feature of the stationary approach is clearly incorrect. Indeed finite-size 
effects induce a cutoff, beyond which the degree distribution falls off very fast. 

Our second approximation consists in incorporating these finite-size effects. This 
approach is however still inexact, as it complies with point (2) of the discussion of 
Section 2.1, albeit not with point (3), concerning the dependence of the degree of the 
node on its index. Finite-size effects on the degree statistics have been investigated in 
detail in [11]. The distribution f k (n) of the degree k at time n and the corresponding 
cumulative distribution F k {n) scale as 

F fc (n)w^*(y), V = ^ (3-10) 
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with $(y) = — (y/2)^'(y). Both scaling functions depend on the initial condition. 
The known exact expressions of $(?/) [11, Eq. (3.47)] yield 




(3.11) 



$ (A) (y) = erfc 
yW( y ) = 

$(B) (y ) = 
*CB)( y ) = 

Our second approximation to the asymptotic distribution of Y, given by the stationary 
approach including finite-size effects, reads therefore 

Py,PSs(r)-^pe--(^ 2 (3.12) 

(with 'FSS' for 'finite-size scaling'). As expected, this expression is better behaved than 
the Frechet distribution (3.9). It falls off as exp(— Y 2 /A), so that all the moments of Y 
are finite. From a quantitative viewpoint, a numerical evaluation of the appropriate 
integrals yields the estimates (^)fss ~ 

1.842146 and (Y) ( ^ s fx 1.966678. It is also of 
interest to get an estimate for the asymptotic value of the reduced variance 

var/c max (n) varF(n) 

v(n) = l^M? = W' (3J3) 

We obtain similarly v^g S fx 0.161205 and v^ s fx 0.179513. 

Figure 7 shows a plot of the mean rescaled largest degree (Y(n)) = (k max (n)) /n 1 / 2 
(left) and of the corresponding reduced variance v(n) (right), against n^ 1 ^ 2 , for both 
initial conditions. Throughout this section on the BA model, simulations are realized 
by using the redirection algorithm proposed in [15] and recalled in the beginning of 
Section 4. Data are averaged over 10 6 different histories up to time n = 10 5 . For both 
quantities shown in the figure, as well as other ones to be presented below, the data 
nicely converge to asymptotic values which can be measured accurately, with leading 
corrections proportional to \/k+(n) ~ n~ x l 2 . An explanation for this form of corrections 
will be given in Section 3.3. The measured asymptotic values read (Y)^ fx 2.00, 
(y)g) fx 2.16, v£) fx 0.108 and fx 0.111. The estimates (F)fss for the mean degree 
are satisfactory: they are only some 10% below the observed values, and correctly predict 
that (Y)^ is larger than (Y)^\ The approach is however less accurate for the reduced 
variance, as the estimates (i>)fss are too large at least by some 50%. Figure 8 shows a 
histogram plot of the distribution of the rescaled largest degree Y{n). The large value of 
time (n = 10 5 ) ensures that the data virtually follow the asymptotic distributions 
and py" 1 ■ Data have been sorted in bins of width 5k = 30 ~ (k uiax )/20, as (fc max )( A ) ~ 631 
and (A; max )( B ) fx 684. 
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Figure 7. Statistics of the largest degree in the BA model for both initial conditions. 
Left: plot of (Y(n)) = (fc max (n))/n 1/ ' 2 against n -1 / 2 . Right: plot of the reduced 
variance v(n) against n -1 / 2 . The data have the asymptotic values (Y)^ w 2.00, 
(Y)£ ] w 2.16, w 0.108 and w 0.111. 




Figure 8. Plot of the 'asymptotic' (n = 10 5 ) distribution py of the rescaled largest 
degree Y, for the BA model with both initial conditions. Data are sorted in bins of 
width 5k = 30. The difference between both initial conditions is small but significative. 



3.2. Number of leads 

The number of leads C(n) up to time n can again be estimated by considering the effect 
of co- leaders. The probability of a lead change is still proportional to C(n) — 1, whereas 
each co-leader now has a probability k max (n)/(2n) to attach the (n + l)-st node. 

Within the framework of stationary extreme- value statistics, with fj, m 4/k 3 and 
Fk 2/k 2 , the expression (A. 6) shows that the probability of having C = 2 is small, 
whereas that of having C = 3 is still smaller, and so on. In other words, with respect 
to the UA model, co-leaders are now more rare, but the few ones that are present are 
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15 




Figure 9. Plot of the mean number (C(n)) of leads against Inn 
with both initial conditions. The full lines have slopes 



for the BA model 
0.39. 



0.45 and A ( p 



more efficient. We thus obtain the estimate 

(C(n + 1)) - (C(n)) « p Q fl exp(-nF fc ) A ( 3 .14) 

For a large time n, using the above estimates, the right-hand side can be shown to boil 
down to l/(2n). We thus obtain the same logarithmic growth for the mean number of 
leads as for the UA model [8]: 

(£(n)) « A c Inn. (3.15) 

Furthermore the stationary approach yields the estimate Ac,st&t = 1/2 for the amplitude. 

Figure 9 shows a plot of (C(n)) against Inn for both initial conditions. The data are 
observed to follow the logarithmic law (3.15), with two different amplitudes, A C A ^ m 0.45 
and A c 0.39, in the same range as the estimate Ac iStat of the stationary approach. 

3. 3. Index of the leader 

The essential ingredient is again the mean index of a node of given degree k. The 
numerator gk{n) of the expression (2.11) can be shown to obey the recursion 

(n+l)Z(n)g k (n+l) = n(fc-l)^ fc _i(n)+n(Z(n)-A;)/ fc (n)+(n+l)(J M .(3.16) 

In the stationary regime, we have again gk(n) ~ njk, with 

(k + 4) 7fc = (fc-l) 7jfe _ 1 + 24,i, (3.17) 

hence 



and finally 



lk k(k + l)(k + 2)(k + ?,){k + A) (3 ' 18) 



. . . 12n . 
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0.05 0.1Q 0.15 0.20 
n -l/2 

Figure 10. Plot of the mean index of the leader (I(n)) against n -1 / 2 , for the BA 
model with both initial conditions. The asymptotic values read (1)^ ~ 3.40 and 
(/>£> « 2.67. 

Replacing k in this expression by the power law (3.6) for the typical degree of the leader, 
we obtain a result of order unity. We are thus led to the conclusion that the index of 
the leader typically stays finite, and especially that its mean value has a finite limit 
(/)oo- Within the framework of stationary extreme- value statistics, evaluating (7) stat as 
(12n//c 2 ) using (3.7), we obtain the estimate (i%tat — 6. Finally, as the expression (3.19) 
admits an expansion in powers of 1/k, the stationary approach suggests that the leading 
finite-time corrections to the limit (I)oo ar e proportional to l/k max (n) ~ n" 1 / 2 . 

Figure 10 shows a plot of the mean index of the leader (I(n)) against n _1//2 , for both 
initial conditions. The data converge to well-defined asymptotic values (1)^ ~ 3.40 
and (1)^ ~ 2.67, whereas the leading corrections are observed to be linear in n -1 / 2 . 

3.4- Number of distinct leaders and lead persistence probability 

The behavior of these last two quantities can be deduced from what is already known 
by means of the following line of reasoning. The number D(n) of distinct leaders 
up to time n is at most equal to the largest index of the leader up to time n, 
/ max (n) = max(7(l), . . . , I(n)). Loosely speaking, D{n) cannot grow typically faster 
than the index of the leader. It is therefore expected to stay of order unity, and to 
share with J(n) the property of having a finite asymptotic mean (D)^. More generally, 
the whole distribution of the number of distinct leaders is expected to converge to a 
non-trivial distribution p^> in the limit of long times. Finally, as the lead persistence 
probability S(n) is nothing but the probability that D{n) equals one, it is also expected 
to have a finite limit = Pd(1)- 

Figure 11 shows a plot of the mean number (D(n)) of distinct leaders (left) and of 
the persistence probability S(n) (right) against n -1 / 2 , for both initial conditions. The 
data converge to the asymptotic values (D)£> « 2.22, (£>)£) « 1.94, S£> « 0.279 and 
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Figure 11. Left: plot of the mean number (D(n)) of distinct leaders against 



-1/2 



Right: plot of the lead persistence probability S(n) against n 



-1/2 



n 

for the BA model with both initial conditions have asymptotic values (D)^ 
(D)^ w 1.94, w 0.279 and w 0.389. 



Data 
2.22, 




D 

Figure 12. Plot of the 'asymptotic' (n = 10 5 ) distribution pu of the number D of 
distinct leaders, for the BA model with both initial conditions. 



~ 0.389. Figure 12 shows a plot of the full distribution of the number of distinct 
leaders for n = 10 5 and both initial conditions. The plotted data are virtually equal to 
the asymptotic distribution p D . 

4. The general preferential attachment (GPA) model 

In the general preferential attachment (GPA) model, the probability of attachment to a 
node is proportional to the sum of the degree of that node and of a constant parameter 
c > — 1, representing the initial attractiveness of a node [9]. This parameter is relevant, 
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as it yields the continuously varying exponents 

T = c + 3, (4-1) 

The BA and UA models are respectively recovered when c = and c — > oo. 

More precisely, in the GPA model the attachment probability of node n to node % 
reads 

_ kj(n-l) + c 

P»>*- Z (n-1) ■ 1 ] 

The partition function in the denominator reads Z(n) = 2L(n)+cn. It therefore depends 
on the initial condition according to 

Z (A) (n) = (c + 2)ra-2, Z^(n) = (c + 2)n - 1. (4.3) 

The stationary degree distribution of the GPA model can be derived along the lines 
of Section 2.1. The distribution fk(n,i) of the degree fcj(n) of node i at time n obeys 
the recursion 

Z(n)/ fc (n + 1, i) = (k + c - l)/ fc _i(n, i) + (Z(ra) - fc - c)/ fc (ra, i), (4.4) 

with fk(i,i) = Sk,i for i > 2. The distribution of the degree of an arbitrary node 
therefore obeys the recursion 

(ra+l)Z(ra)/ fc (ra+l) = n(fc+c-l)/ fc _i(n)+n(Z(n)-A;-c)/ fc (n)+<J M .(4.5) 

Finally, the stationary degree distribution f k obeys the recursion 

(k + 2c + 2)f k = (k + c- l)/ fc _x + (c + 2)4,i, (4.6) 

hence [9, 16] 

(c + 2)r(2c + 3)r(A; + c) 
Jfc r(c+l)r(A; + 2c + 3) ' 1 ' 

This expression has a power-law decay at large k: 

/^ (c+ r f^ +3) r^,. (4 . 8) 
r( c + 1) 

In the regime where fc and c are both large and comparable, the expression (4.7) assumes 
the scaling form [11, Eq. (4.29)]: 

/ fc ~exp(-c0(/c)), (4.9) 

with k = k/c and 

(j>( K ) = (« + 2) ln(K + 2) - (k+ l)ln(K+ 1) -21n2. (4.10) 

The linear behavior 0(/t) w /tin 2 as k — > matches the exponential fall-off (2.2) of the 
UA model, whereas the logarithmic growth 0(/t) « lnre + 1 — 21n2as/t^oo matches 
the power-law decay (4.8). 

The GPA model can be simulated very efficiently (in time n) by means of the 
following redirection algorithm [15]. At time n, in order to attach node n, an earlier 
node i = 1, ... ,n — 1 is chosen uniformly, and node n is attached either to node i 
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itself with probability 1 — is, or to the ancestor a(i) of node i with probability v. The 
ancestor of % is the node a{%) — 1, . . . , % — 1 to which node % has attached at time %. The 
attachment probabilities (4.2) are recovered if the redirection probability v is chosen 
to be equal to the growth exponent (4.1), hence the notation. In particular, the linear 
attachment probabilities of the BA model are recovered when the redirection probability 
is v — 1/2. Initial conditions are implemented by requiring that some 'primordial nodes 
have no ancestor [15]. For Case A, nodes 1 and 2 have no ancestor, whereas node 1 is 
the ancestor of node 3. For Case B, node 1 has no ancestor, whereas it is the ancestor 
of node 2. The numbers of primordial nodes can be read off from the expressions (4.3) 
of the partition function: these are the integers which appear after the minus signs. 

For any finite c > —1, i.e., any value < v < 1 of the growth exponent, the 
GPA model behaves in every respect similarly to the BA model (corresponding to 
v = 1/2), investigated in Section 3. The largest degree k mSuX (n) grows as n v . Setting 
fc max (n) = Y(n) n", the rescaled variable Y{n) has a limiting distribution p Y , which 
depends on v and on the initial condition. The mean (Y(n)) and the reduced variance 
v(n) converge to finite limits (Y)^ and v^, with leading corrections proportional to 
l//c*(n) ~ n~ u . The mean number of leads scales as (C(n)) « Ac Inn, where the 
amplitude A c depends on v and on the initial condition. Finally, the mean index of the 
leader, the mean number of distinct leaders, and the lead persistence probability have 
finite limiting values (J)oo, (D)<x> an d 5*00. The leading corrections to these limits are 
again in n~ u . The convergence is therefore very slow at small u, where a crossover to the 
UA model is observed (see below). In practice limiting values cannot be extrapolated 
from data for reasonable times (n = 10 5 - 10 6 ) with enough accuracy for v < 0.1. 

The dependence of the key asymptotic quantities (/)oo, (-D)oo, 5oo and on 
the growth exponent v for both initial conditions is shown in Figure 13. The data 
corresponding to the BA model, discussed in Section 3, are recovered for v = 1/2, i.e., 
right in the middle of the plots. The behavior at both endpoints also deserves some 
attention. 

• v — > 1, i.e., c — > —1. In this limit, the GPA model boils down to a ballistic growth 
model. Indeed the attachment probability k + c vanishes for k — 1, i.e., the initial 
degree of each node. As a consequence, with our conventions on the initial states, the 
first node attracts all the subsequent ones. Its degree grows ballistically as fci(n) ~ n, 
whereas its lead keeps undisputed. This picture explains the trivial limiting values of 
the key quantities ((/)oo — > 1, (-D)oo — > 1, ^oo — > 1, Voc — > 0). The leading corrections 
to these limits are observed to be proportional to c + 1, i.e., to 1 — v. 

• v — > 0, i.e., c — > oo. This regime is more subtle than the previous one. Consider 
a very large value of the parameter c ~ \jv. The scaling behavior (4.9) expresses 
that the static degree distribution first falls off exponentially, as in the UA model (for 

-C c), and then crosses over (for k ~ c) to a power-law fall-off with a large exponent 
(for k ^> c). Assume now that the degree k under consideration grows as the largest 
degree k max (n) ~ Inn in the UA model (see (2.7)). The above crossover then takes 
place at an exponentially large time n, such that Inn becomes comparable to c. As a 
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Figure 13. Plot of the four key asymptotic quantities of the GPA model with 
both initial conditions: 1x1(1)^, (D)^, — lnS^ and i^, against the growth exponent 
v=l/(c + 2). 



consequence, a rough estimate of the behavior of the key quantities as v — > can be 
obtained by replacing Inn by l/v in the large-time behavior of the same quantities for 
the UA model. We have seen in Section 2 that (I{n)) grows as a power law (see (2.17)), 
whereas S(n) falls off as a power law (see (2.21)), and (D(n)) grows logarithmically 
(see (2.22)). Finally, (k max (n)) grows logarithmically, whereas var& max (n) stays finite, 
hence v(n) ~ l/(lnn) 2 . We thus obtain the estimates 

ln(/)oo ~ (D)oo ~ —In ^ ~ Voo ~ v 2 . (4.11) 

up to multiplicative constants which are not predicted by the crossover argument. In 
spite of their crudeness, the above estimates are in qualitative agreement with the data 
shown in Figure 13. The first three quantities indeed exhibit a similar kind of divergence 
at small is, reaching values of order 4-6 for v = 0.1. The data for the reduced degree 
variance i>oo are observed to vanish faster than linearly as v — > 0, and to be compatible 
with a quadratic law (dashed line). Finally, since the reduced variance vanishes at 
both endpoints, it has a non-trivial maximum (f^ max ~ 0.120 and f^ max ~ 0.118). 
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5. Discussion 

We have presented a comprehensive study of the statistics of leaders and lead changes 
in growing networks with preferential attachment. The quantities investigated in this 
work concern either the leader at a given time (degree and identity of the leader, number 
of co-leaders) or the whole history of lead changes (number of lead changes, of distinct 
leaders, lead persistence probability). So far very few works had been devoted to the 
subject of leaders in this sense, either in growing networks [8, 13] or in other random 
structures [6, 17]. The present paper is also to some extent a continuation of our 
previous work [11] on finite-size effects in the degree statistics. Both articles are meant 
to be systematic and consistent with each other. In particular, the same three different 
attachment rules (UA, BA, GPA) and the same two different initial conditions (Cases A 
and B) are dealt with in parallel in both works. 

The first and most natural quantity of this area is the largest degree k max (n), i.e., 
the degree of the leader at time n. As recalled in the Introduction, there are (at least) 
three ways of defining a time- dependent characteristic degree scale in a scalefree network 
(see e.g. [9, 10, 11]): 

(i) the cutoff scale k*(n), beyond which the stationary degree distribution is strongly 
affected by finite-size effects; 

(ii) the largest degree k mSuX (n); 

(iii) the degree ki(n) of a node with any fixed index, say the first one (i = 1) for 
definiteness. 

These three degree scales have the same growth law as a function of time n, i.e., a 
logarithmic growth for the UA model, and a power law with exponent v in the scalefree 
case. The agreement between definitions (ii) and (iii) is rather natural, as the first 
node has the highest chance of being the leader. The agreement between definitions (i) 
and (ii), although less obvious, can be explained within the framework of what we have 
called 'stationary extreme- value statistics'. Although this approach cannot be exact (see 
Section 2.1), it however turns out to correctly predict the order of magnitude of k max (n) 
by the condition that the stationary probability for the degree k to be larger than 
k max (n), i.e., the cumulative probability F k , be of order 1/n. This criterion implies that 
fc max (n) is comparable to k+(n), where Fk drops more or less suddenly from large to small 
values. The above heuristic picture can be complemented with quantitative results. In 
the UA model, the three degree scales grow logarithmically with time n, albeit with 
different (exactly known) prefactors, and different scalings for fluctuations. The degree 
of the first node is asymptotically distributed according to a Poissonian law [8, 11, 15] 
with (ki(n)) ~ lnn + 7 E . The largest degree grows as k m£LX (n) m lnn/m2 (see (2.7)), 
whereas fluctuations around this typical value are of order unity. They are indeed given 
by a 'discrete Gumbel law', studied in Appendix A. Finally, the finite-time cutoff in 
the degree distribution takes place around ~ 2 Inn, in a narrow range whose 

width scales as (Inn) 1 / 2 [11]. In the BA model, the above degree scales grow as n 1//2 , 
whereas fluctuations remain broad, i.e., of order unity in relative value. The degree of 
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Quantity 


<>0 CO 




Woo 


(D)oo 




Case A 


2.00 


0.108 


3.40 


2.22 


0.279 


Case B 


2.16 


0.111 


2.67 


1.94 


0.389 


A(%) 


7 


3 


21 


13 


28 



Table 1. Dependence on the initial condition (Case A or Case B) of the asymptotic 
large-time values of quantities of interest for the BA model, obtained by an accurate 
extrapolation of numerical data (see Section 3): mean reduced largest degree (Y)oo, 
reduced variance of the largest degree, mean index (I)oo of the leader, mean number 
(£>)oo °f distinct leaders, and lead persistence probability Soo. The last row gives the 
relative difference between data for both initial conditions, expressed as a percentage 
of the larger figure. 

the first node is asymptotically distributed according to a geometric law [8, 11, 15] with 
(ki(n)) ps n 1 / 2 . The largest degree scales as k max (n) ~ Yn 1 ^ 2 , where the rescaled 
variable Y has a non-trivial limiting distribution py, which depends on the initial 
condition (see Figure 8). This distribution is qualitatively different from the Frechet 
distribution predicted by 'stationary extreme-value statistics', whereas the Gumbel 
distribution put forward in [13] cannot be more than a good fit of over some range. 
Finally, the cutoff scale is also smeared into a non-trivial finite-size scaling law, 

which also depends on the initial condition (see (3.10)). The same kind of asymptotic 
results holds more generally for the GPA model, where the growth exponent 1/2 of the 
BA model is replaced by the continuously varying exponent v — l/(c + 2). 

Our results on the other observables, especially those concerning the whole history 
of a growing network (number of lead changes, of distinct leaders, lead persistence 
probability), are the outcome of a blend of heuristic reasoning and accurate numerical 
work. In the scalefree case (BA and GPA models), the present study confirms and 
makes more quantitative the picture emerging from the pioneering work of Krapivsky 
and Redner [8]. The finiteness of (I)oo and (-D)oo means that a typical infinite history 
of a scalefree network involves finitely many distinct leaders, which are chosen among 
the oldest nodes. For the BA model, the numerical results of Section 3, recalled in 
Table 1, show that the number of distinct leaders is in fact quite small (around two on 
average), whereas the index of the leader is hardly larger (around three on average). 
The entire history even consists of a single lead with an appreciable lead persistence 
probability (around one third). As already underlined in [8], this is a strong form 
of the rich- get-richer principle. 

The lead persistence probability and other key quantities exhibit a weak but 
significative dependence on the initial condition, as demonstrated by the data of Table 1. 
The feature that scalefree networks remember their whole past forever, and especially 
their infancy, already visible in finite-size effects on the degree statistics, is thus fully 
confirmed general phenomenon. 

Finally, the study of the statistics of leaders and lead changes can be expected to 
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shed some new light onto more complex models. We think especially of the Bianconi- 
Barabasi (BB) model [18], an extension of the BA model where the attachment rule 
involves both dynamical variables (the node degrees ki(n)) and quenched disordered 
ones (the node fitnesses rji). The BB model has the remarkable feature that it may 
exhibit a continuous transition, somewhat analogous to the Bose-Einstein condensation, 
or to the condensation transitions which take place in classical stochastic models 
such as the zero-range process (ZRP) [19]. It is convenient to parametrize the node 
fitnesses as activated variables: rji = exp(— Ei/T), where the activation energies E{ are 
drawn from a temperature-independent distribution. The BA model is recovered in 
the infinite-temperature limit, whereas the opposite zero-temperature limit yields an 
interesting record-driven growth process, investigated at length in [20]. Depending on 
the distribution of the £j, the BB model may have a low-temperature condensed phase 
below some finite condensation temperature T c . The dynamics of the model in its 
condensed phase, already tackled in [21], is not fully understood yet. The statistics of 
leaders seems to be an appropriate tool, as the condensate should naturally appear as 
a long lived leader. We intend to return to this problem in a near future. 

Appendix A. Extreme-value statistics for geometric integer variables 

This Appendix gives a self-contained exposition of various results concerning extreme- 
value statistics for integer variables. The theory of extreme-value statistics is most 
currently presented in the case of real variables with a continuous distribution [12]. The 
peculiarities of discrete (e.g. integer) distributions have been underlined in [22]. 

After some general formalism, we study more extensively the case of a large 
collection of n independent integer variables ki (i — 1, . . . , n), referred to as degrees, 
drawn from the common geometric distribution 



The parameter a can assume any value in the range < a < 1. The stationary degree 
distribution (2.2) of the UA model is recovered for a = 1/2. We consider successively 
the largest degree, /c max (n) = max(A; 1 , . . . , k n ), and the number C(n) of co-leaders, i.e., 
the number of node indices i such that k{ = k mSuX (n). 

Al. Distribution of the largest degree 

Let us start with some general formalism. The derivation of the distribution of the 
largest degree k max (n) follows the usual line of reasoning of extreme- value statistics [12]. 
Introducing the cumulative distribution 



f k = (1 - a)a' 



k-i 



(*>1). 



(A.I) 



CO 



F k = Prob{£;, >k} = ^f. 



(A.2) 



j=k 



so that 



fk — F k — F k+1 , 



(A.3) 
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we have 

Prob{A; max (n) < k} = (1 - F k ) n . (A.4) 
The distribution of the largest degree, <p k = Prob{/c max («.) = k}, thus reads 

<f> k = (l- F k+1 ) n - (1 - F k ) n . (A.5) 
Using (A. 3), this expression can be expanded as 



n 'n 



The integer C > 1 is to be interpreted as the number of co- leaders (see (A.25)). 

In the present situation of interest, i.e., the geometric distribution (A.l), the above 
general formulas read 

F k = a*- 1 (A.7) 

and 

4> k = (1 - a k ) n - (1 - a^ 1 )". (A.8) 

Hereafter we are mostly interested in large values of n. In this regime, the latter 
expression can be 'exponentiated' as follows: 

fc ne-™" -e- nak ~\ (A.9) 

The mean value of k max (n) can thus be recast as 

oo 

(AwM> « E k (e~ nak ~ e" nafc "). (A.10) 

k=l 

The sum is dominated by the values of k such that na k is of order unity. We thus obtain 
the estimate 

(Aw(n)) ~ ■ (A.11) 
A more refined analysis goes as follows. Consider the generating function 

oo 

G n (s) = ( e sfcm "( n )) « e sk (e~ nak - e"™^ 1 ). (A.12) 

k=i 

The estimate (A. 11) suggests to set 

n = a~^, i.e., = j + £, (A.13) 

|ma| 

with j integer and < £ < 1 for definiteness, and 

k = j + m . (A. 14) 

With these notations, the expression (A.12) can be recast as 

G n (s)«A(s,£K /|lna| , (A.15) 

with 

oo 

A(s,0= E e^-^e-^-e-^- 1 -'). (A.16) 
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The amplitude A(s,£) is a periodic function of £, with unit period. Its Fourier series 
can be derived by means of the Poisson summation formula, in the form 

OO OO /'OO 

E /("»-0= E f(x)e 2 ™ x dx. (A.17) 

m=— oo n=— oo 00 

We thus obtain 

00 

A(s,0= E M*)e 27ri< , (A.18) 

ra=— 00 

with 

, , e s — 1 fs + 27rin\ 

The oscillations in the amplitude A(s,£), described by the Fourier modes with n^O, 
are extremely small, except if a is very close to zero. The Fourier coefficients indeed fall 
off exponentially as |a n (s)| ~ exp(— n 2 \n\ / |lna|), irrespective of s. For a = 1/2, our 
case of interest, this estimate reads |a n (s)| ~ (6.549 x lCT 7 )'"', so that even the first 
amplitudes (n = ±1) are very small. 

Asymptotic expressions for the mean and the variance of the largest degree can be 
obtained by expanding (A. 15) to second order near s = 0. Neglecting the tiny periodic 
oscillations in £, i.e., replacing the full function A(s,£) by the constant amplitude ao(s), 
we are left with 

, In n + 7 B 1 , / \ 71-2 1 / » \ 

(fcmaxW) w — r: j— + -, var/c max (n) w — — + — , A.20 

|ma| 2 b{ma) z 12 

where 7 E ~ 0.577215 denotes Euler's constant. 

It is worth comparing the present problem to its continuous analogue, namely the 
distribution of x max (n), the largest of a large number n of i.i.d. random variables Xi > 0, 
with exponential density f(x) = \e~ Xx , with the identification A = |lna|. This is a 
standard problem of extreme- value statistics [12]. Setting 

* max (n) = ^±1, (A.21) 
the reduced variable £ obeys a Gumbel law with density 

P ? (0 = e- ? - e " 5 . (A.22) 
We have (e s5 ) = T(l — s), so that (£) = 7 E and var£ = 7r 2 /6, hence 

lnn + 7 E 7r 2 
(WW) ~ ^ , varx max (n) w — . (A.23) 

These results coincide with the first terms of the expressions (A.20), whereas the second 
terms there, involving rational numbers, are intrinsic effects of discreteness. 

We will refer to the distribution (A. 9) as a discrete Gumbel law. Let us now 
turn to other features of this distribution. Using the definitions (A. 13), (A. 14), the 
expression (A. 9) can be recast as 

fc ^e" a -e" a . (A.24) 
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(i) 




Figure Al. Plot of the highest probability </>hi g h(£) of the largest degree against £ for 
geometric integer variables with a = 1/2. Symbols: maximum and minimum values 
</>high,max and 0high,min, respectively corresponding to situations (i) and (ii) described 
in the text. 



The full distribution exhibits periodic oscillations in £, with unit period. These 
oscillations have been shown to be very small in the case of global characteristics, such 
as the mean and variance. They are however more visible in the shape of the distribution 
near its maximum. This shape indeed oscillates between: 

(i) a symmetric triangular form (• •) with a well identified most probable integer, 

(ii) a fiat tabular form (• •) with two equally probable integers. 

This phenomenon is best illustrated by considering the highest probability 0hi g h, 
i.e., the probability of the most probable integer m. This quantity is again a periodic 
function of £, with unit period. It can be checked that the highest probability is reached 
for m = 1 (i.e., k max (n) = j + 1) if < £ < £ c and for m = 2 (i.e., /c max (n) = j + 2) 
if £ c < £ < 1, where the threshold value £ c depends on a. Situation (ii) corresponds to 
£ — £o where 0hi g h(£) takes its minimal value </>hi g h,mm, the two equally probable integers 
being j + 1 and j + 2. Situation (i), where 0hi g h(£) takes its maximal value 0hi g h,max, 
takes place near £ £ c — 1/2. Figure Al shows a plot of h i g h against £ for a = 1/2. 
The minimum 

</>hi g h,min = V$ - 2 w 0.236067 is reached at the threshold £ c w 0.944744, 
whereas the maximum 0hi g h,max = 1/4 is reached for £ 0.471233. 

A 2. Distribution of the number of co-leaders 

The distribution of the number of co-leaders C(n) can be obtained as follows. The 
expansion (A. 6) allows one to readily write down the joint probability of k max (n) 
and C(n) as 

Prob{A; max (n) = k, C(n) = C} = (™) /£ (1 - F fc )"" c , (A.25) 
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so that the distribution pc{C) = Prob{C(n) = C} of the number of co-leaders reads 

Pc(C) = fypfi(l-F k )»- G . (A.26) 

Using (A.l), (A. 7), the 'exponentiation' valid at large n, and the definitions (A. 13), 
(A. 14), this expression can be recast as 

Pc(C)* { -^-^- £ at-^e-^ 1 - 5 (A.27) 

' m=— oo 

The sum again defines a periodic function of £, depending on the value of C, with unit 
period. Neglecting tiny periodic oscillations, we are left with the following asymptotic 
distribution of the number of co-leaders: 

Pc(C) = ^ 0-^, (A.28) 
| In a | C 

known as a logarithmic distribution. In particular, the asymptotic mean number of 
co-leaders reads 

(C) = (A.29) 
a \ma\ 
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